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BACKGROUND! The ovarian reserve (number and quality of oocytes) is correlated with reproductive potential as well as somatic health, 
and is likely to have multiple genetic and environmental determinants. Several reproductive hormones are closely linked with the oocyte pool 
and thus can serve as surrogate markers of ovarian reserve. However, we know little about the underlying genes or genetic variants. 

METHODS! We analyzed genetic variants across the genome associated with two hormonal markers of ovarian reserve, FSH and anti- 
Mullerian hormone, in a reproductively normal population of Caucasian (n = 232) and African American (n = 200) women, aged 25-45 
years. We also examined the effects of environmental or lifestyle factors on ovarian reserve phenotypes. 

RESULTS: We identified one variant approaching genome-wide significance (rs6543833; P= 8.07 x I0~ 8 ) and several nominal variants 
nearby and within the myeloid-associated differentiation marker-like (MYADML) gene, that were associated with FSH levels in African American 
women; these were validated in Caucasian women. We also discovered effects of smoking and oral contraceptive use on ovarian reserve 
phenotypes, with alterations in several reproductive hormones. 

CONCLUSIONS: This work is the largest study on ovarian reserve in women of reproductive age and is the only genome-wide study on 
ovarian reserve markers. The genes containing or near the identified variants have no known roles in ovarian biology and represent inter- 
esting candidate genes for future investigations. The discovery of genetic markers may lead to better long-range predictions of declining 
ovarian function, with implications for reproductive and somatic health. 
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Introduction 

Infertility is a significant problem in society, affecting 15% of 
reproductive-aged couples (Hull eta/., 1985; Menken and Larsen, 1994; 
Reijo et at., 1995). Many cases of infertility appear to be the result of 
defects in germ cell development that lead to reduced numbers or 
quality of gametes. The quantity and quality of oocytes within the 
ovaries, or ovarian reserve, are closely correlated with reproductive 



potential and are likely to have multiple genetic and environmental deter- 
minants. Further, ovarian reserve is affected by aging and disease, and is 
closely tied to both reproductive and somatic health. Many diseases 
including osteoporosis, cardiovascular disease and various cancers are 
associated with loss of oocytes/follicles, ovarian dysfunction and/or the 
length of the female reproductive lifespan (Cooper and Sandler, 1 998; 
Hartge, 2009). Hence, better prediction of both normal and aberrant 
oocyte depletion would have significant clinical implications. 
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Several reproductive hormones are closely correlated with the 
ovarian reserve and the inevitable process of follicle depletion. 
Current methods for measuring ovarian reserve and predicting age 
at fertility loss and menopause include assessment of serum levels of 
FSH, inhibin B and anti-Mullerian hormone (AMH), which decreases 
~I0 years prior to menopause (van Rooij et a/., 2004). AMH and 
FSH are direct and indirect reflections of follicle number, respectively. 
AMH is a member of the transforming growth factor-(3 superfamily; it 
suppresses the cyclical recruitment of primordial follicles into the pool 
of growing follicles and is specifically secreted by the granulosa cells of 
pre-antral and antral follicles in the ovary (Durlinger et a/., 2002). Thus, 
AMH is present in proportion to the number of antral follicles. FSH on 
the other hand, is an indirect reflection of the follicle pool; it is a 
gonadotrophin secreted by the anterior pituitary in response to hypo- 
thalamic release of GnRH. It stimulates follicular growth and is a key 
component of the brain-pituitary-ovary axis (Gromoll and Simoni, 
2005). FSH is sensitive to the inhibitory influence of inhibin 
B. Inhibin B is secreted by the granulosa cells of developing and 
early antral follicles. So as the number of follicles declines, so do 
inhibin levels and FSH begins to rise. Thus, there is an association 
between high FSH and low follicle number with ovarian aging. 

Antral follicle count (AFC), transvaginal ultrasound-guided assess- 
ment of follicles, is routinely performed during fertility assessments. 
We have recently analyzed the correlations of several hormonal 
markers and AFC with age in Caucasian women (unpublished data), 
finding that only AFC and AMH parallel the decline of follicles with 
age noted in histological ovarian specimens (Hansen et al., 2008). In 
contrast, FSH shows an inverse relationship with AFC and follicle 
number in vivo; as follicle number and AMH decline, FSH rises. It is note- 
worthy that both AMH and FSH levels are highly variable among 
women, even measured at the same point in the menstrual cycle. 

The age of menopause has a strong heritability (Snieder et al., 1 998; 
de Bruin et al., 200 1 ; Murabito et al., 2005). Ovarian aging (OVA), which 
leads to menopause with the exhaustion of the follicular pool, also 
seems to have a genetic component (Rosen et al., 2010a). Genetic 
markers associated with ovarian reserve may have the advantage of 
providing better long-range predictions of risk for declining ovarian 
reserve. Thus, determining the genetic associations of rising FSH and 
declining AMH may allow for improved counseling regarding reproduct- 
ive health and aging. 

Most previous work has focused on measuring the reproductive events 
of menarche and menopause, or comparing ovarian reserve markers and 
their ability to predict reproductive outcomes in the context of infertile 
populations or small observational studies. Moreover, few studies have 
investigated the genes or variants associated with specific reproductive 
hormones. Therefore, we sought to identify genes and variants across 
the genome associated with hormonal markers of ovarian reserve in a 
multi-ethnic cohort recruited from the general population. We also 
sought to examine the effects of environmental or lifestyle factors includ- 
ing smoking and oral contraceptive use, on ovarian reserve phenotypes. 

Materials and Methods 

Study population 

The final study population included 232 Caucasian and 200 African 
American women, aged 25-45 years, who were prospectively enrolled 



in a community-based cohort of the OVA Study. These women were 
not seeking treatment for infertility or other medical problems. The 
OVA Study comprises a multi-ethnic cohort and is focused on elucidating 
clinical and genetic markers of OVA. Subjects were recruited from poten- 
tially eligible female members of Kaiser Permanente (KP) Northern 
California, who resided within proximity to the Women's Health Clinical 
Research Center at the University of California San Francisco (UCSF). 
The socio-demographic and health-related characteristics of the KP 
patient population are representative of the population of Northern Cali- 
fornia (Gordon, 2006), and the final OVA participants were similar to the 
KP population. Subjects were included if they were ovulatory with normal 
predictable menstrual cycles of 22-35 days. Women with oligo- or ano- 
vulation, surgically diagnosed endometriosis, ovarian failure (loss of all fol- 
licles before the age of 40 years) or a history of uterine or ovarian 
surgery were excluded. Subjects were also excluded if they had taken 
oral contraceptives or medications containing estrogen or progestin 
that alter the menstrual cycle within the 3 months prior to enrollment. 
This work was approved by the Institutional Review Boards at KP, 
UCSF and Stanford University, and informed consent was obtained for 
all subjects. 

Phenotypes and covariates of ovarian reserve 

Blood samples were collected by venipuncture on Day 2-4 of the 
menstrual cycle and used for all serum hormonal assays and genotyping. 
FSH and LH levels were measured with standardized two-site chemilu- 
minescence immunoassays. The intra-assay coefficients of variation 
(CVs) were 1.9-2.1%, whereas the inter-assay CVs were 5.2-6.8%. 
AMH was measured using a commercially available ELISA, which uses a 
direct competitive immunoassay (Beckman Coulter, Marsee, France). 
The AMH CVs for intra- and inter-assays were <5.6 and <I5.3%, 
respectively. Estrone (E|) and inhibin B were measured using a commer- 
cially available ELISA (Diagnostic Systems Laboratories, Webster City, 
TX, USA). The intra-assay CVs for E| and inhibin B were 4.8-9.9 and 
3.3-7.2%, respectively, whereas the inter-assay CVs were 7.7-13.4 
and 7.8- 1 7%, respectively. Estradiol (E 2 ) and testosterone were mea- 
sured with an automated chemiluminescent assay using the Bayer Diag- 
nostics ACS: 1 80 (Bayer Diagnostics Corp., Tarrytown, NY, USA). The 
intra- and inter-assay CVs for E 2 were 6.5-6.9 and I 3.6- 1 6. 1%, respect- 
ively. For total testosterone, the intra-assay CVs were 2.9-4.0% and the 
inter- assay CVs were 6.3-6.6%. 

AFCs were performed as previously described (Rosen et al., 2010b). 
Briefly, all subjects underwent transvaginal ultrasound assessment of 
ovarian volumes and AFC on Day 2-4 of their menstrual cycle. All follicles 
with a mean diameter (of two dimensions) of 2- 10 mm were counted; all 
echo-free structures were regarded as follicles (Supplementary data, Fig. 
SI). To eliminate experimental variability all women were examined at 
the same time in their menstrual cycle, by one of two physicians, using 
the same equipment, and unclear ultrasound results were excluded. Inter- 
examiner concordance exceeds 90%. 

All subjects completed a comprehensive questionnaire to obtain 
detailed health, reproductive, lifestyle and demographic information, as 
previously described (Rosen et al., 2010b). The age of each woman 
was rounded off to the nearest 1 0th of a year. Body measurements 
including weight, height and waist and hip circumference, were also 
obtained. Parity or the number of children born to each woman was 
also assessed. Cigarette smoke exposure was quantified from responses 
to questionnaire data and categorized as 'never, past or current' smoker 
and pack-years of smoking. Previous history of taking estrogen- and/ or 
progestin-containing oral contraceptive pills (OCPs) was also quantified 
from questionnaire data and categorized as 'never or past' and total 
months of OCP use. 
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Genotyping and quality control 

Genotypic data for the Caucasian and African American women was 
obtained using the Genome-Wide Human SNP Array 6.0 (Affymetrix, 
Santa Clara, CA, USA). This microarray contains more than 900 000 
single nucleotide polymorphism (SNP) markers and 940 000 copy 
number variant (CNV) markers; approximately one marker every 700 
bases across the genome (Affymetrix, 2007). Genomic DNA was 
extracted and purified from white blood cells using the QIAamp DNA 
Blood Maxi Kit (Qiagen, Venlo, The Netherlands) according to the man- 
ufacturer's instructions. The DNA ligation, hybridization and microarray 
scanning steps were completed according to the manufacturer's protocols 
(Affymetrix). 

Stringent quality control (QC), concordance checks and SNP filtering 
were employed using Genotyping Console (GTC) v.4.0 software (http:// 
www.affymetrix.com) to eliminate poor samples and poorly performing or 
rare SNPs. The QC measures used were: QC call rate or percentage of a 
specific set of 3022 control SNPs genotyped or 'called' in the samples 
using the dynamic algorithm (filtered for QC call rate >86%); and contrast 
QC, which is an estimate of the ability to resolve SNP signals into three 
genotype clusters (filtered for contrast QC >0.4). Contrast QC, the 
most important QC measure, uses a static set of 10 000 randomly 
chosen SNPs and quantifies the difference between the highest and 
lowest contrast values of the homozygote and heterozygote genotypes. 
Laboratory technical QC criteria also included SNP fingerprints for 
sample tracking and early detection of sample misidentification, the use 
of HapMap controls to check genotype quality, gender confirmation 
based on mean intensities of SNP probes on the X and Y chromosomes, 
and tracking of reagent and instrument performance. In the Caucasian 
group, of the final 232 samples that passed QC and filter criteria, the 
mean contrast QC was 1 .87 + 0.046, the QC call rate was 93.9 + 
0.2% and the overall sample call rate was 98.0 + 0.08%. In the African 
American group, of the final 200 samples, the mean contrast QC was 
2.23 + 0.036, the QC call rate was 97.0 + 0.1% and the overall sample 
call rate was 99.4 + 0.08%. SNPs were genotyped using GTC v.4.0, 
using the Birdseed v.2.0 algorithm (Korn et ai, 2008), which is an exten- 
sion of the Bayesian Robust Linear Model with Mahalanobis Distance using 
perfect-match probes. Clustering analysis and examination of relative 
sample intensity, as well as filtering for high sample call rates (>95%) 
ensured high sample quality prior to downstream association analysis. Indi- 
vidual SNPs across the genome were filtered for SNP call rates >95%, 
missing call rates <5%, minor allele frequency (MAF) >l% and lack of 
significant deviation from Hardy-Weinberg equilibrium (P > 0.007). Of 
909 622 SNP probes on the array, 677 261 and 738 185 in the Caucasian 
and African American cohorts, respectively, passed QC and SNP filter 
criteria and were independently tested for association with FSH and 
AMH. Using the 940 000 CNV probes on the array, future investigations 
will examine CNVs across the genome and their association with hormo- 
nal and clinical markers of ovarian reserve. All SNP locations are refer- 
enced to National Center for Biotechnology Information (NCBI) human 
genome build 37 (February 2009) and dbSNP build 131 (April 2010) 
(http://www.ncbi.nlm.nih.gov/projects/SNP). Gene names provided in 
all tables and text represent the HUGO Gene Nomenclature Committee 
(HGNC) gene symbols of the gene containing, or closest to, the identified 
SNP (http://www.genenames.org/). 

Determination of population structure 

For ethnicity validation and tests of population structure a principal 
component analysis (PCA) using singular value decomposition (svd) was 
performed. Subjects of the HapMap CEU (Utah residents with N. and 
W. European ancestry from the CEPH collection) and YRI (Yoruban in 
Ibadan, Nigeria) cohorts were genotyped using the same genotyping 



platform (http://hapmap.ncbi.nlm.nih.gov/) (International HapMap 
Consortium, 2005). The OVA cohorts were combined with the 
HapMap CEU and YRI cohorts, and missing genotypes were imputed 
using the program BEAGLE v.3.0.2 (http://www.stat.aukland.ac.nz/ 
-bbrowning/beagle/beagle) (Browning, 2006; Browning and Browning, 

2007) . Genotypes at 10 000 SNPs, selected to have low levels of linkage 
disequilibrium (LD) and MAF >0.0I, distributed across several chromo- 
somes were coded as 0, I or 2 (AA, AB and BB) and were mean- 
centered. PCA was performed using the 'svd' function of the statistical 
program R v.2. 1 I.I (http://www.r-project.org/) (Gentleman and lhaka, 

2008) to analyze allele composition and population homogeneity (Supple- 
mentary data, Fig. S2). 

Statistical analysis 

Phenotypic and covariate analysis 

Descriptive statistics were calculated for all hormonal, environmental and 
demographic variables in the study cohorts. The relationship between 
hormonal variables, follicle number and age was analyzed in linear regres- 
sion analyses. For binary variables including smoking and OCP use, linear 
regressions of the hormone concentration versus age were compared 
between women that currently or previously smoked or used OCPs, 
and those that did not. Statistical differences between the slopes and inter- 
cepts of the regressions were analyzed using the F-test for variable inclu- 
sion with subject age mean-centered. P < 0.05 was considered statistically 
significant. Three models with different effects of smoking (or OCP use) 
were tested against the simple null model of an association between the 
hormone alone (hormone ~ Age). The three models tested were: (I) 
smoking changes the intercept (hormone ~ Age + Smoking); (2) the 
slope (hormone ~ Age + Smoking x Age) or (3) both slope and intercept 
(hormone ~ Age + Smoking + Smoking x Age). For these analyses, 
women that previously smoked were grouped with smoking women, as 
we reasoned if a woman ever smoked, there may have been irreversible 
effects on her ovarian reserve. Similar analyses were performed on the 
associations between follicle number and smoking or OCP use. Pack-years 
of smoking and total months of OCP use were also analyzed for their 
association with serum hormone levels, with control for age. Outliers 
were removed based on sensitivity analyses. Mean total serum hormone 
levels or AFCs were also compared between smoking (or OCP-using) 
and non-smoking (or non-OCP-using) women, using the Student's t-test 
or Welch 2-Sample t-test within R v.2. 1 I . I (Gentleman and lhaka, 

2008) . Covariates, including smoking, OCP use and follicle number, that 
were significantly associated with FSH or AMH were also controlled for 
in downstream genome-wide association (GWA) analyses. 

SNP association analysis 

GWA analyses were performed in the context of a robust regression of 
FSH or AMH versus age using the application quantreg v.4.44 (Koenker, 

2009) within R.A. Fisher's exact test for single-marker (SNP) and multiple- 
marker (haplotype) association using the allelic test was performed to 
determine associations with serum hormone levels using BEAGLE 
v.3.0.2 and quantreg v.4.44; both programs were integrated in 
R v.2. 1 I . I . The effect sizes or the difference in the mean hormone level 
from the overall regression fit, by allele or genotype were calculated for 
all SNPs. In parallel validation analyses, allelic and genotypic tests using 
the Kruskal-Wallace rank-sum test determined SNPs significantly asso- 
ciated with FSH or AMH, as quantitative variables. For each SNP allele 
and genotype, a linear regression fit of FSH or AMH versus age was 
obtained, the residual was computed for each woman, and the women 
were ranked into groups based on the allele/genotype to determine 
whether one allele/genotype had higher or lower residuals (effect sizes) 
than expected by random chance alone. Associations determined by the 
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Fisher's exact test were also significant as determined by the Kruskal- 
Wallace test, for all top hits. 

LD of variants, the number of SNPs involved in the association signals, 
the genes and the recombination rates of the regions surrounding the top 
hits were analyzed within BEAGLE, Haploview v.4.2 (http://www. 
broadinstitute.org/mpg/haploview) (Barrett et ai, 2005) and LocusZoom 
v. I . I (http://csg.sph.umich.edu/locuszoom/) (Pruim et ai, 2010). LD 
tagging was performed with the Tagger program (de Bakker et ai, 2005) 
within Haploview to capture SNPs with squared correlation coefficients 
of R 2 > 0.8 and LD of flanking SNPs was similarly analyzed within 
LocusZoom. 

To verify associations and determine final significance levels, permuta- 
tion testing (1000 tests) was performed and P< 0.05 were selected for 
at the chromosome-wide level. Bonferonni correction for multiple 
testing and family-wise errors rates were computed to verify significant 
associations at the genome-wide level. P-values were computed and ana- 
lyzed at both the chromosome- and genome-wide-levels, owing to our 
sample size/power constraints. Further, as several chromosomes have 
loci associated with ovarian reserve and function, variants were also ana- 
lyzed specifically at the chromosome level. Although accepted genome- 
wide significance thresholds are typically ~5 x I0~ 8 , because of our 
marginal sample size (which is large for a clinical/biological study but 
moderate for a GWA study), assuming a slightly higher genome-wide 
threshold of P < I x 1 0~ 7 , we note that the top hit identified in this 
study is at a marginal genome-wide significance level. Custom-made pro- 
grams and algorithms were used to mediate streamlined analyses and visu- 
alization of results within R. 

Results 

Genetic association analysis 

As shown in Table I, in 232 Caucasian and 200 African American 
women, the average FSH levels were 7. 1 + 0.2 and 7.4 + 0.3 IU/1, 
respectively. The mean AMH level was 30. 1 + 1 .5 pM in Caucasians 
and 22.8 + 1 .7 pM in African Americans, with a significant difference 
between the two racial groups (P < 0.00 1 ). Both hormones were cor- 
related with female age (Fig. I) and follicle number (P < 0.05). In 
Caucasian women, FSH had an extrapolated rate of increase of 
~0.27 IU/1 per year of female age (Fig. I A). In African American 
women, similar relationships were observed, with a rise of 
~0. 16 IU/1 of FSH per year of female age (Fig. I A). AMH was most 
closely correlated with both female age (Caucasians: R 2 = 0.22; 
African Americans: R 2 = 0.043) and follicle number (Caucasians: 
R 2 = 0.67; African Americans: R 2 = 0.36; P < 0.05); AMH decreased 
with increasing age and declining AFCs. In Caucasian women, AMH 
decreased ~2 pM per year, while in African American women it 
decreased ~0.84 pM per year (Fig. IB). Both FSH and AMH levels 
were less strongly correlated with age in the African American popula- 
tion, compared with the Caucasian population (FSH: R 2 = 0.06 versus 
0. 1 5; AMH: R 2 = 0.04 versus 0.22, respectively). FSH and AMH levels 
were more variable among African American women and higher follicle 
numbers were not always predictive of higher AMH levels. 

As two good hormonal markers of ovarian reserve, FSH and AMH 
were independently analyzed for associations across the genome. 
Figure 2 shows the GWA results for FSH in African American 
(Fig. 2A and B) and Caucasian (Fig. 2C and D) women. Figure 3 
depicts the genome-wide results for AMH. In the African American 
cohort, the top three genetic loci associated with FSH levels localized 



Table I Variables of the study population. 



Values are shown as mean + SEM or percentage of the cohort. 
AMH, anti-Mullerian hormone: OCP, oral contraceptive pill. 
a Parentheses denote AFC values (mean + SEM). 

^Significant differences between Caucasian and African American women 
(P-C0.00I). 



to chromosomes 2 and 10 (Fig. 2B and Table II). Notably, one 
marginally genome-wide significant SNP (rs6543833) was associated 
with FSH in the African American population (P = 8.07 x I0~ 8 ). 
This variant, at 2p22.3, is located just upstream of the 
myeloid-associated differentiation marker-like (MYADML) gene and was 
associated with a mean increase of 1 .0 + 0.2 IU/1 and a decrease of 
0.6 + 0.3 IU/1 FSH for the CC and GG genotypes, respectively 
(MAF = 0.27; Table II and Fig. 4A). The next most significant SNP, 
rs 1 24658 1 I, at the same loci on chromosome 2, was also associated 
with FSH (P = 4.28 x 1 0" 7 ; MAF = 0.32). This variant was associated 
with an increase of 1 .0 + 0.2 IU/1 and a decrease of 0.4 + 0.2 IU/1 
FSH for the GG and TT genotypes, respectively (Table II and 
Fig. 4A). Regional association analyses of the 800 kb surrounding 
these loci indicated that these variants were in LD and several 
nearby SNPs were involved in the association signal (Fig. 2E). The 
third nominally associated variant rs I 1255291, localized to I Op 1 4, 
upstream of the inter-alpha (globulin) inhibitor H2 (ITIH2) gene and 
was associated with a decrease of 1 .3 + 0.2 IU/1 and an increase of 
1 .2 + 0.4 IU/1 FSH for the TT and CC genotypes, respectively 
(Table II). 

In the Caucasian cohort, the top three genetic loci associated with 
FSH localized to chromosomes 1 2, 5 and I I (Fig. 2C and D). As sum- 
marized in Table II, the two most significant SNPs in the Caucasian 
women, rs64886l9 and rs 1 006 1 804, were nominally associated with 
FSH (P = 9.43x I0" 7 and 1.47 x I0" 6 ; MAFs = 0.25 and 0.28) 
and localized to I2pl3.l and 5p 1 3.3, respectively. 





Caucasians 






African Americans 


Mean ± SEM 


n 


wean 3cn 


n 


Age (years) 


35.3 + 


0.3 


232 


35.6 ± 0.4 


200 


FSH (IU/1) 


7.1 ± 


0.2 


232 


7.4 ± 0.3 


200 


AMH (pM) 


30.1 ± 


1 .5* 


232 


22.8 + 1 .7* 


200 


LH (IU/1) 


5 8 + 


0 2* 


232 


4.5 + 0.2* 


200 


Inhibin B 


54.9 + 


1 .9* 


232 


44.4 ± 2.4* 


200 


(pg/ml) 












Estradiol 


48.3 + 


3.6 


232 


45.7 ± 1.7 


200 


(pg/ ml) 












Estrone 


61.7 + 


1.6 


232 


63 3 + 1 6 


200 


(pg/ml) 












Total 


52.9 + 


1 .0* 


232 


45.3 ± 1.0* 


200 


testosterone 
























Past/current 


45.5% (15.9 + 


1.0) 


06 


23.3% (14.3 + 1.4) 


47 


smokers 3 












Non-smokers 3 


54.5% (14.9 + 


0.7) 


127 


76.6% (16.0 + 0.9) 


154 


OCP users 3 


79% (14.9 + 


0.6) 


195 


72% (14.3 + 0.8) 


145 


Non-OCP 


21% (17.2 + 


1.3) 


48 


28% (18.9 + 1.6) 


56 


users 3 
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FSH - Caucasians 



FSH - African Americans 




B 



AMH - Caucasians 



35 

Age (years) 

AMH - African Americans 




Age (years) 

Figure I Measurements of FSH and AMH versus age in women of the entire study population. Total serum concentrations of FSH and female age in 
Caucasian and African American women indicate that FSH increases with age in both ethnic groups (A). Total serum concentrations of AMH versus 
age indicate that AMH decreases with age but is highly variable between women and is more variable among African American women (B). The 
corresponding correlation coefficients (R 2 ) and linear equations are shown (n = 200-232 women). 



Variant rs64886l9, is located within an intron of the glutamate recep- 
tor, ionotropic, N-methyl D-aspartate 28 (GRIN2B) gene and led to a 
mean decrease of I.I + 0.4 IU/1 and an increase of 1 .2 + 0.3 IU/1 
FSH for the CC and TT genotypes, respectively. Analysis of the sur- 
rounding region indicated that several variants at this loci were asso- 
ciated with FSH and a few of these were in LD (Supplementary 
data, Fig. S3A). Variant rs 1 006 1 804 on chromosome 5 is located 
within an intron of the natriuretic peptide receptor C/guanylate cyclase 
C (NPR3) gene and was associated with increases of 2. 1 + 0.8 and 
1.1+ 0.4 IU/1 FSH for the TT and CT genotypes, respectively. 
There were several SNPs in close proximity at the 5p 1 3 .3 region 
that approached chromosome-level significance (see Fig. 2B). The 
last SNP, rs4944804 at I I q 1 3.4, weakly associated with FSH, is 
located immediately upstream of the ATG autophagy related 16-like 2 
(Saccharomyces cerevisiae) (ATG 1 6) gene and was associated with an 
increase of 1 .2 + 0.4 IU/1 FSH for the GG genotype. 

There were fewer genetic variants associated with AMH compared 
with FSH, in both Caucasian and African American women. In the 
African American cohort, weakly associated variants localized to 
3p25. 1 and I I p 1 5. 1 near or within the tetra-peptide repeat homeobox- 
like (TPRXL) and transmembrane protein 86A (TMEM86A) genes, 
respectively (Table III and Fig. 3A and B). The top genetic loci in 
the Caucasian cohort were located on chromosome 6, with only 
very weakly associated variants on chromosomes 18, 4 and 13 
(Fig. 3C and D). The most significant SNP rs 1 22 1 3875, which was 
nominally associated with AMH (P = 2.96 x 1 0" 7 ; MAF = 0.37) loca- 
lized to 6p23 and is upstream of the Jumonji, AT rich interactive domain 2 
(JARID2) gene. This variant was associated with a mean increase of 
1 3.8 + 4.2 pM, an increase of 9.9 + 2. 1 pM and a decrease of 
13 + 1 .4 pM AMH for the AA, AG and GG genotypes, respectively 



(Table III and Fig. 4B). Several variants adjacent to the top SNP asso- 
ciated with AMH in both ethnic groups showed some degree of LD 
and many variants in these regions were associated with the signals 
(Supplementary data, Fig. S4). Many other SNPs were only weakly 
associated with AMH levels and are located within or near 
the thioredoxin-related transmembrane protein 3 (TMX3), inositol 
polyphosphate-4-phosphatase, type II (INPP4B) and WAS protein family, 
member 3 (WASF3) genes. Interestingly, several variants associated 
with FSH or AMH (this study), or AFC (previous unpublished 
work), localized to the same genomic regions of I2pl3. 1 — 13.2 and 
I3ql2.l3. 

In replication studies, the top variants associated with FSH were 
examined in the Caucasian population as an independent replication 
cohort. The top SNP rs6543833, had a P-value of 0.012 with an 
MAF of 032, while the next most significant SNP rs 1 24658 1 I, had 
a P-value of 0.071 with an MAF of 0.33 (Supplementary data, Table 
SI). When only controlling for these two tests, the top SNP was 
significant in the replication cohort at a P < 0.05 level. As shown in 
Supplementary data, Table SI, the 300 SNPs flanking this variant 
were also examined and it was discovered that nine adjacent SNPs 
had relatively low P-values of 1.5 x I0~ 3 to 7.2 x I0~ 3 . The regional 
plots of these variants and their association signals are also indicated in 
Supplementary data, Fig. S3B. Comparing the expected versus the 
observed numbers of SNPs in this region with these low P-values indi- 
cates that this cluster of SNPs was unlikely the result of random 
chance. There were three times as many SNPs with low P-values 
than expected based on observed P-value distributions. These findings 
provide confirmation of the top SNP as well as other associated loci in 
the surrounding region. Interestingly, all of these variants occurred 
within or upstream of the MYADML gene, which may represent an 
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Figure 2 Summary of GWA results for FSH by chromosome showing Manhattan plots of the uncorrected (A and C) and corrected (B and D) 
P-values of the Fisher's exact test in the African American (A and B) and Caucasian (C and D) populations. Each point represents a SNP from the single 
SNPs and haplotype tests of association remaining after QC and SNP filtering (677 26 1 in the Caucasian cohort and 738 1 85 in the African American 
cohort). The red dashed lines indicate P-values of 1 .5 x I0~ 6 (uncorrected) and 0.05 (corrected), chromosome-wide significance level. (E) Regional 
association analyses of 800 kb (272 SNPs) flanking the top variant associated with FSH in the African American cohort (rs6543833), including the 
number of SNPs involved in the signal, their P-values, the LD among these variants, genes in the region and the recombination rates (hg build 18, 
HapMap Phase II CEU reference, legend showing LD). 



excellent candidate gene for future studies in women of both 
European and African ancestry. Replication of these findings in inde- 
pendent cohorts of reproductive-aged women and meta-analyses in 
larger combined GWA studies are needed. 

In an attempt to further examine the genetic variants associated 
with FSH and AMH, we tested the top hits for association with 
another well-known marker of ovarian reserve — AFC. A total of 



four and five SNPs associated with FSH or AMH levels in Caucasian 
or African American women, respectively, were tested for association 
with AFC. Final P-values were corrected for the number of tests and 
considered significant at a level of P < 0.05. Effect sizes in terms of 
direction (higher or lower FSH/AMH levels or AFCs) and magnitude 
(total differences in the mean hormone concentration or mean 
number of follicles from the overall regression fit) for the effect 
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Figure 3 Summary of GWA results for AMH by chromosome showing Manhattan plots of the uncorrected (A and C) and corrected (B and D) 
P-values of the Fisher's exact test in the African American (A and B) and Caucasian (C and D) populations. Each point represents a SNP from the single 
SNPs and haplotype tests of association remaining after QC and SNP filtering (677 261 in the Caucasian cohort and 738 185 in the African American 
cohort). The red dashed lines indicate P-values of 1.5 x I0~ 6 (uncorrected) and 0.05 (corrected), chromosome-wide significance level. 



allele/genotype were compared for all variants tested. Interestingly, as 
shown in Table IV several variants associated with FSH or AMH were 
also associated with the number of antral follicles. The most significant 
variant, rs6543833, was associated with a decrease of 0.62 + 
0.34 IU/1 FSH and an increase of 1.09 + 0.32 antral follicles for the 
GG genotype (P = 9.25 x I0~ 3 ). Variant rs9875589 associated with 
AMH in African Americans, had the strongest association with follicle 
count; the effect allele, C, led to an increase of 14.02 + 3.72 pM AMH 
and a corresponding increase of 4.88 + 0.63 follicles (P = 1 .84 x 
I0~ 4 ). Notably, in almost all cases the magnitudes and directions of 
effects were as expected, with variants associated with lower FSH 
levels correlated with higher follicle counts and variants associated 
with higher AMH levels correlated with higher follicle counts. Control- 
ling for follicle number in the GWA of FSH/AMH did not affect the 
top associations. Further, there were both shared and independent 
associations between the hormonal markers and follicle number, indi- 
cating that both of these traits are independently associated with 
distinct genetic variants and are significant markers of ovarian reserve. 

Phenotypes and covariates of ovarian reserve 

Serum levels of various circulating hormones potentially related to 
ovarian function and OVA were measured as shown in Table I. The 
means and ranges of hormone levels were within the spectrum of 
normal variation. There were significant differences between the 



hormone profiles of the two racial groups, with pronounced 
differences in AMH (P = 0.00074), LH (P=2.28 x I0" 5 ), inhibin B 
(P = 0.00049) and testosterone (P = 0.0004 1 ). All of these hormones 
were significantly lower in the African American women. The associa- 
tions between environmental variables including smoking and OCP 
use, and ovarian reserve markers were also analyzed. Among the 
Caucasian population, 45.5% were either current or previous 
smokers (14.6 current and 30.9% previous smokers) with an overall 
average of 9.6 + 0.6 years of smoking and a range of I -25 years. 
Fewer African Americans were current or previous smokers with 
only 23.3% being current (11.4%) or previous (11.9%) smokers, 
with an overall average of 9.8 + 1 .2 years of smoking and a range of 
I -27 years. There was no significant difference between the mean 
AFCs of women that were current or previous smokers versus that 
of non-smokers in the Caucasian cohort (P = 0.36) nor in the 
African American cohort (P = 0.34). In the Caucasian population, 
when controlling for age there was no correlation between the total 
pack-years of smoking and follicle number (R 2 = —0.057, P > 0.05). 
However, there was a trend for decreased follicle numbers among 
the smoking versus non-smoking African American women (14.3 + 
1.4 versus 16.0 + 0.9, respectively). Although, owing to the small 
number of African American smokers further comparisons were not 
feasible. 

As shown in Fig. 5, the circulating serum levels of several reproduct- 
ive and sex steroid hormones were significantly different between 
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smoking and non-smoking Caucasian women. Testosterone levels 
were 56+ 1.6 versus 50.3+ 1 .2 ng/dl (P = 0.0035), AMH levels 
were 33.5 + 2.4 versus 27.4 + 1 .7 pM (P = 0.038) and E, 
levels were 65.2 + 2.8 versus 58.8 + 1.7 pg/ml (P= 0.043), among 
smoking and non-smoking women, respectively (Fig. 5G). The regres- 
sion fit of these and other hormones with female age was also signifi- 
cantly different between smoking and non-smoking women, with the 
most pronounced differences in E 2 and testosterone levels (P = 
0.0045 and 0.0053, respectively; Fig. 5E and F). The regression of 
AFC versus age was also significantly different between smoking and 
non-smoking women, with a greater extrapolated rate of follicle 
decline among smoking women (P = 0.039; Fig. 5A). 

Seventy-nine percent of the Caucasian women and 72% of the 
African American women had previously used OCPs. Total months 
of use were 63.0 + 4.2 (range = 1-216 months) in the Caucasian 
population and 56.0 + 5.4 (range = I -264 months) in the African 
American population. In the Caucasian women, there were no statis- 
tically significant differences between the mean serum hormone levels 
or AFCs of women who previously took OCPs versus those who did 
not (P > 0.05). There was also no dose-response or association 
between the total number of months of OCP use and the total 
number of follicles (R 2 = 0.0 1 4 and I x I0~ 6 in Caucasians and 
African Americans, respectively). In contrast, in the African American 
population there was a significant increase in the number of follicles in 
women who had never taken OCPs (P = 0.0046; Fig. 5H). In both 
ethnic groups, there were trends for increased follicle numbers in 
women who had never taken OCPs compared with those who had 
(Caucasians: 17.2+ 1.3 versus 14.8 + 0.6, respectively, P= 0.058; 
African Americans: 18.9+1.6 versus 14.3 + 0.8, P= 0.0046; 
Fig. 5H). There were similar trends with several reproductive hor- 
mones and AFC regressed on age, between previous OCP-users 
and non-OCP-users (Supplementary data, Fig. S5). Despite the 
effects of smoking and OCP use on ovarian reserve phenotypes, 
adjusting for them in linear regression analyses did not affect the 
genetic associations. This indicates that the effect of each variant 
likely occurs directly on ovarian reserve/hormone levels, rather than 
through one of these covariates. 

Discussion 

Genetic variants associated with FSH 
and AMH 

As FSH and AMH are two established clinical markers of ovarian 
reserve and are closely associated with the number of antral follicles 
within the ovaries, we aimed to determine whether they might be 
associated with underlying genetic loci. Identification of these loci 
could provide markers and long-range predictors of poor ovarian 
reserve and infertility. Here, we used a genome-wide approach to 
analyze genetic variants associated with these reproductive hor- 
mones, as surrogate measures of ovarian reserve in a multi-ethnic 
fertile population of women. We identified I SNP, rs6543833, 
approaching genome-wide significance at P=8.07x I0 -8 and 
several nominal SNPs associated with FSH levels in African American 
and Caucasian women, respectively, whereas, only nominally signifi- 
cant variants were associated with AMH levels. In validation 
studies, the top hit associated with FSH was significant in the 
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Figure 4 Phenotypes and genotypes of SNPs associated with FSH and AMH. Shown are scatter plots depicting high (H) and low (L) FSH and AMH 
by age in African American (A) and Caucasian (B) women. Shown is the distribution of genotypes among the women with respect to age and FSH or 
AMH for the most significant variants: (a) rs6543833 and (b) rs 1 24658 1 I (for FSH) and (c) rs I 22 1 3875 (for AMH). The three genotypes are desig- 
nated by blue, orange and red points (n = 200-232 women). 



other ethnic cohort (Caucasians). Additionally, there were nine adja- 
cent SNPs in the nearby upstream region with significantly low 
P-values, and all were within or near the MYADML gene. We also 
found that several top hits associated with FSH or AMH were also 
significantly associated with follicle number, with similar magnitudes 
and directions of effects. These results provide validation of the asso- 
ciation and evidence for underlying genetic associations between 
these three phenotypes, as well as their ability to serve as independ- 
ent but also overlapping ovarian reserve markers. Moreover, several 
variants associated with AMH or FSH (this study), and follicle count 
(unpublished data), localized to the same genomic regions of 
1 2p 1 3. 1 - 1 3.2 and 1 3q 1 2. 1 3, which may be a hot spot for causative 
alleles linked with ovarian reserve. 

None of the variants were within or close to known ovarian genes 
and therefore identify excellent candidate genes for further investiga- 
tions. While these findings are interesting and the largest study to 
date on genetic markers of FSH and AMH, future meta-analyses and 
additional validation in independent studies are needed. To our knowl- 
edge, this is the only study to genotype and examine ovarian reserve 
hormones in a fertile population of women. All previous cohort 
studies that have analyzed some aspect of ovarian reserve have utilized 
either post-menopausal women (over age 45 years) or women 
recruited through infertility/reproductive clinics and are therefore 
not reproductively 'normal' populations. This work is the first to 
analyze a cohort of women (non-menopausal) of reproductive age 
recruited from the general population and not seeking infertility 



treatments. This study is also the first to examine these reproductive 
traits in women of African as well as European ancestry, and may help 
to provide a baseline and foundation for future study of these ethnic 
groups. 

As indicated by our previous work in a larger population of 245 
Caucasian women, and validated here in both Caucasians and 
African Americans, of all hormonal, anthropometric and environmen- 
tal variables, AMH and FSH were most closely associated with female 
age and follicle number. AMH paralleled the pattern observed 
between follicle number and age both in vivo and in histologic 
ovarian specimens (Hansen et a/., 2008), while FSH was inversely 
related. Consistent with the role of AMH, we found that in Caucasian 
and African American women AMH and follicle number were closely 
related and both declined with female age. Interestingly, AMH levels 
were more variable among African American women compared 
with Caucasian women, and showed a weaker correlation with age. 
As predicted, FSH levels increased with both female age and decreas- 
ing follicle number in both ethnic groups, and surprisingly, were also 
more variable and less strongly correlated with age among African 
American than Caucasian women. It is interesting to note that none 
of the subjects had polycystic ovary syndrome or other reproductive 
disorders, which could be associated with elevated AMH or FSH 
levels. All women also had regular cycles and no hyperandrogenemia. 
Therefore, the greater variability cannot be explained by ovarian or 
other hormonal disorders. To our knowledge there is no other 
study with which to compare these results and no other study to 
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assess these hormones in normal fertile women of African and Euro- 
pean ancestry. 

Several hormones, including FSH, AMH, E 2 and inhibin B, among 
others, have been correlated with ovarian response to controlled 
ovarian stimulation or pregnancy outcomes in IVF cycles (Eldar-Geva 
et a/., 2005; Wunder et a/., 2008; Wu et a/., 2009). Studies have 
reported that AMH may be the best marker of ovarian reserve as it 
can be measured throughout the menstrual cycle and is more 
closely associated with the number of oocytes retrieved and repro- 
ductive outcomes in IVF (Durlinger et a/., 2002; Wunder et a/., 
2008; Wu et a/., 2009). AMH also has a reported lower variability 
within and between cycles in the same women (van Disseldorp 
et a/., 2010). Previous work has also identified a few variants in the 
genes encoding the receptors for AMH and FSH, AMHR2 (Kevenaar 
et a/., 2007) and FSHR (Gromoll and Simoni, 2005; Loutradis et a/., 
2006), and found correlations with various reproductive parameters 
including age at menopause, or FSH sensitivity and menstrual cycle 
length, respectively. Interestingly, a recent large-scale candidate gene 
association study found that the group of genes involved with 
steroid hormone metabolism and biosynthesis pathways including 
FSHB, was significantly associated with age of menopause (He et a/., 
2010). This indicates that several reproductive (i.e. FSH) and sex 
steroid hormones may share underlying genetic associations that are 
linked with the reproductive lifespan. However, many of these 
studies relied on comparisons of hormone profiles and their ability 
to predict reproductive outcomes in the context of small populations 
of infertile or subfertile (or post-menopausal) women. Thus, we now 
identify genetic variants associated with markers of ovarian reserve in 
a large community-based population. 

None of the genes associated with the identified variants have 
demonstrated roles in ovarian biology and are therefore interesting 
candidate genes. As FSH and AMH are related to ovarian function 
and reflective of ovarian reserve, the genes and variants associated 
with them might have roles in the brain-pituitary-ovary axis and be 
expressed within the ovary or brain — and some of them are. The 
two top variants, as well as other nearby variants associated with 
FSH are located upstream or within the MYADML gene at 2p22.3. 
MYADML has almost exclusive expression in the human testis and 
limited expression in the brain (www.ncbi.nlm.nih.gov/unigene). 
However, the role of this gene in ovarian biology is unknown. The 
GRIN2R gene containing the top SNP at I2pl3.l associated with 
FSH in Caucasian women, encodes one subunit of the ionotropic 
NMDA glutamate receptor, which is involved in learning and 
memory and expressed within the brain and testis. Genetic association 
studies have implicated this gene in several neurological diseases (Allen 
et ai, 2008), however studies of this receptor in ovarian function are 
lacking. The two variants marginally associated with AMH in Caucasian 
women are located upstream of the JARID2 gene at 6p23. JARID2 is an 
ortholog of the mouse jumonji gene, which encodes a DNA-binding 
nuclear protein and regulator of histone methyltransferases that is 
essential for embryogenesis and differentiation. JARID2 negatively reg- 
ulates cell growth and proliferation and is ubiquitously expressed, with 
expression in both human and mouse oocytes and ovary, and human 
embryonic stem cells (Ovarian Kaleidoscope: http://ovary.stanford. 
edu/) (Ben-Shlomo et ai, 2002). Therefore, JARID2 could play a 
role in germ cell and/or somatic cell growth within the developing 
ovary. 
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Table IV Associations between FSH- and AMH-related SNPs and AFC in African American and Caucasian women. 



SNP a 


Race 


Gene 


Alleles'" 


MAF 


FSH/AMH genotype effect 
(concentration)' 


AFC genotype effect 
(follicles) 


P-value e 


rs6543833 


Afr Amer 


MYADML 


G/C 


0.27 


-0.62 ± 0.34 (FSH) 


+ 1.09 + 0.32 


1.85 x I0~ 3 
(9.25 x I0~ 3 )* 


rs 1246581 1 


Afr Amer 


MYADML 


T/G 


0.32 


-0.38 ± 0.24 (FSH) 


+0.94 + 0.25 


1.31 x I0~ 3 
(6.55 x I0~ 3 )* 


rs9875589 


Afr Amer 


TPRXL 


A/C 


0.46 


+ 14.02 ± 3.72 (AMH) 


+4.88 ± 0.63 


3.68 x I0~ 5 
(1 .84 x IU f 


rs 12295403 


Afr Amer 


TMEM86A 


T/G 


0.10 


+ 16.67 + 5.1 1 (AMH) 


+5.17 + 0.86 


3.34 x I0~ 3 
(1.67 x I0~ 2 )* 


rsl 1255291 


Afr Amer 


mm 


T/C 


0.25 


- 1 .34 + 0.22 (FSH) 


+ 1.31 +0.36 


0.21 1 (1) 


rs64886l9 


Caucasian 


GRIN2B 


C/T 


0.25 


+ 1.16 + 0.33 (FSH) 


+0.55 ± 0.049 


0.104 (0.416) 


rsl 006 1804 


Caucasian 


NPR3 


T/C 


0.28 


+2.05 + 0.80 (FSH) 


+ 1 .44 + 0.30 


0.310 (1) 


rsl 22 13875 


Caucasian 


JARID2 


A/G 


0.37 


+ 13.82 + 4.18 (AMH) 


+3.08 ± 0.52 


0.0514 (0.206) 


rs9396503 


Caucasian 


JARID2 


T/G 


0.37 


+ 14.92 + 4.60 (AMH) 


+3.74 + 0.66 


0.0415 (0.166) 



P < 0.05 for four out of five FSH/AMH-associated SNPs also associated with AFC, compared with 0.34 (< I ) expected by chance. 
a SNPs at each locus are those associated with serum FSH or AMH levels, rather than those with the strongest signal for follicle number. 
b Alleles are shown as minor/major allele; bold indicates the effect allele. 

c The difference in mean ( + SEM) serum FSH levels (IU/1) or AMH levels (pM) for a given genotype. 

d The difference in mean ( + SEM) number of follicles for the corresponding genotype calculated from the regression analysis. 

e P-values are based on the Fisher's exact test for SNP association with the hormone; parentheses denote corrected P-values; *P < 0.05. 



Interestingly, several variants associated with FSH or AMH were 
also associated with AFC and variants associated with all three 
ovarian reserve markers localized to some of the same genomic 
regions. A variant associated with AMH (rsl 7084228) and a variant 
associated with AFC, were located within the same 30-kb genomic 
region of I3ql2.l3, where the nearest genes are GPRI2 and 
WASF3. Both genes are predominantly expressed in the brain, with 
WASF3 also highly expressed in the ovary. One SNP associated with 
FSH (rs64886l9), and two other SNPs associated with AFC, were 
also located in a similar genomic region at I2pl3.l and I2pl3.2, 
respectively. Therefore, these genomic regions on chromosomes 12 
and 13, which also comprised long stretches of associated SNPs in 
LD, may be hot spots for causative alleles or genes associated with 
ovarian reserve. Further, as all of these ovarian reserve markers had 
both distinct and shared genetic loci, this emphasizes the multi- 
factorial and polygenic nature of ovarian reserve and female fertility. 
Future functional studies may examine the expression of these genes 
in human ovarian tissue and female germ cells, altered gene expression 
in ovarian tissue from women with different genotypes, and genetic 
over-expression and silencing studies in ovarian and stem cell lines. 
This work will help to elucidate the expression patterns and functional 
roles of these candidate genes in human ovarian biology. 

Environmental factors associated with 
ovarian reserve 

Environmental factors can affect human reproductive traits. We dis- 
covered modest effects of smoking and OCP use on clinical markers 
of ovarian reserve, with alterations in several reproductive and sex 
steroid hormones, as well as follicle numbers. Smoking has adverse 
effects on fertility and pregnancy. There is continuing debate about 



the relationship between cigarette smoke and ovarian failure. Studies 
have reported an association between smoking and an earlier age at 
natural menopause, with women who smoke entering menopause 
1-2 years earlier than comparable non-smokers (Torgerson et al., 
1994; Bromberger et al., 1997; van Noord et al., 1997; Cooper 
et al., 1999; Gold et al., 2001). Former smokers also reportedly 
have a slightly earlier age of menopause than non-smokers (Willett 
et al., 1983; Cooper et al., 1999) and women prenatally exposed to 
maternal cigarette smoke may enter menopause sooner (Strohsnitter 
et al., 2008). Limited studies on special patient/infertility populations 
have suggested that smoking is not associated with AFC or AMH 
levels (Kinney et al., 2007; Nardo et al., 2007) but is associated with 
elevated FSH levels (Kinney et al., 2007). 

It is plausible that tobacco toxins have detrimental effects on the 
follicular pool and increase rates of follicular atrophy and atresia, 
leading to lower AFCs, alterations in ovarian reserve hormones, and 
ultimately, reduced fertility and an earlier menopause. We found 
that smoking was associated with increased serum levels of testoster- 
one, E|, E 2 , AMH and a trend for increased FSH levels. This under- 
scores the potential effects of smoking on the ovarian reserve and 
suggests that smoking may alter steroidogenesis, OVA and/or the 
hypothalamic-pituitary-ovarian axis. Indeed, previous work has found 
that levels of sex steroid hormones, such as testosterone, are signifi- 
cantly elevated in smokers (Deslypere and Vermeulen, 1984; Dai 
et al., 1988), while other studies have reported alterations in sex 
steroids including lower concentrations of E h E 2 and estriol (Soldin 
et al., 201 I). 

Other effects of tobacco toxins include increased rates of apoptosis 
and necrosis in various tissues of the body, as well as increased apop- 
tosis in primordial germ cells differentiated from human stem cells in 
vitro (Kee et al., 2010). These effects might also manifest as increased 
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Figure S The association between smoking or oral contraceptive use and several reproductive hormones and follicle counts. Shown are serum 
hormone levels or AFCs versus age for women that were current or previous smokers (red points) or non-smokers (black points) (A-F). The 
data for all women were fit with a linear regression (gray line) and the regressions for smoking (red line) and non-smoking (black line) women 
were compared. P-values indicate a significant change in slope, intercept or both slope and intercept, associated with smoking. There were significant 
differences in serum levels of estrone (E|), estradiol (E 2 ) and testosterone (T), and AFC between smoking and non-smoking women (P < 0.05). Also 
shown are mean serum levels of testosterone, E| and AMH in smoking (closed bars) and non-smoking (open bars) women (G) and mean AFCs in oral 
contraceptive-using (open bars) and non-using (closed bars) African American and Caucasian women (H) (*P < 0.05). 



rates of follicular apoptosis or dysfunction within the ovaries. Indeed, 
polycyclic aromatic hydrocarbons in cigarette smoke are toxic to 
ovarian follicles in both animal models (Essenberg et a/., 1951) and 
humans (Mattison and Thorgeirsson, 1978). We found that neither 
previous nor current smoking was associated with overall lower follicle 
counts. However, smoking was associated with an increased extrapo- 
lated rate of follicular decline in Caucasian women (P = 0.039). There 
was no dose-response between the pack-years of smoking and follicle 



number (P > 0.05). Our findings are consistent with the work of Gold 
and co-workers, one of the largest, most comprehensive studies on 
menopause (Gold et at., 2001). In 14 620 middle-aged women of 
the Study of Women's Health Across the Nation, smoking was asso- 
ciated with an earlier age of menopause (0.3-1.2 years earlier) but 
there was no dose- response effect of the amount of cigarettes 
smoked per day on menopausal age. Similarly, we found no dose- 
response of smoking but smoking was positively correlated with an 
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increased rate of follicular decline. There was also a positive correl- 
ation between female age and number of years of smoking, with 
older women on average having smoked for a greater number of 
years. Any effects of smoking on the follicle pool therefore, might 
be most evident in mid-life. 

A slightly later age of menopause has been reported in women that 
previously used OCPs (van Noord et at., 1997; Gold et at., 2001), 
while other studies have reported no association (Bromberger et a/., 
1997; Hardy and Kuh, 1999). It has been hypothesized that OCP 
use delays menopause by the hormonal actions of suppressing follicu- 
lar recruitment and ovulation, thereby leading to increased numbers of 
follicles. In contrast, we found that OCP use did not increase follicle 
number. Instead, women who had taken OCPs had similar or slightly 
lower follicle numbers compared with women who had never taken 
OCPs (P = 0.058 in Caucasians and P = 0.0046 in African Amer- 
icans). Similar to previous work on menopause (Gold et a/., 2001), 
we found no relation between duration of prior OCP use and follicle 
number or serum hormone levels. These results suggest that OCP use 
is not associated with increased follicle number or alterations in 
ovarian reserve hormones. It remains speculative that oral contracep- 
tives may actually have some subtle inhibitory effects on follicle 
numbers or ovarian reserve in women of reproductive age. 
However, as we are limited by sample size, longitudinal studies analyz- 
ing these lifestyle factors in larger cohorts of women could provide 
further insight on the biological effects of these exposures on the 
ovarian reserve. It is likely that complex genetic and environmental 
influences affect follicular atresia, as well as the brain-pituitary-ovary 
axis and its regulation of gonadotrophins and sex steroid hormones. 

Conclusion 

This is the first genome-wide study to evaluate genetic associations 
with hormone markers of ovarian reserve. We found several differ- 
ences between the ovarian reserve phenotypes of Caucasian and 
African American women — differences in AMH levels and subtle dif- 
ferences in the associations between hormonal markers with age 
and follicle number. Smoking and OCP use were associated with 
alterations in reproductive hormone profiles and/or follicle number 
in both ethnic groups. We identified one variant of marginal genome- 
wide significance and several nominally-significant variants associated 
with FSH levels in African American women, as well as nominal var- 
iants associated with FSH and AMH levels in both ethnic groups. 
These variants were validated in the Caucasian cohort. Several of 
the variants were also significantly associated with follicle number. 
Interestingly, none of the associated genes have previously demon- 
strated roles in ovarian function and now represent a group of candi- 
date genes for further investigation. Future studies including multi-site 
meta-analyses on additional larger cohorts of women and replication 
in independent studies are needed. Fine mapping or sequencing and 
functional in vitro gene expression and gain- and loss-of-function 
studies on the identified candidate genes might reveal any effects of 
the genetic variants and the putative functions of these genes. Longi- 
tudinal studies are also needed to explore the effect of these variants 
and genes on the decline in ovarian reserve with age. 

A better understanding of the genes and environmental factors that 
impact ovarian reserve may provide greater insight on the process of 
reproductive aging, the genetic requirements of human fertility and risk 



for development of disease. It is hoped that this work may help form a 
foundation for genetically identifying women at risk for early ovarian 
failure, fertility defects and hence associated reproductive and 
somatic diseases. 

Supplementary data 

Supplementary data are available at http://humrep.oxfordjournals.org/. 
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